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PGR AMPLIFICATION OF REARRANGED GENOMIC VARIABLE 
REGIONS OF IMMUNOGLOBULIN GENES 
Background of the Invention 
5 The development of mouse hybridoma technology has 

allowed the production of antibodies (Ab) specific for a 
wide range of antigens. Mouse monoclonal antibodies 
(mAb) have been used extensively for diagnosis and, in a 
few cases, for human therapy and in vivo diagnostics. 
10 Administration of murine antibodies to humans has been 
observed to induce a strong human anti-mouse antibody 
response (HAMA) after single or repeated treatments, thus 
precluding long-term treatment using these antibodies. 
Moreover, rodent antibodies are rapidly cleared from 
15 human serum and often do not interact effectively with 
the human immune system. Since human hybridomas are 
generally unstable and secrete low amounts of antibodies 
(frequently IgMs) , considerable effort has been directed 
at rendering foreign antibodies (e.g., murine antibodies) 
20 more similar to those of the host to which they are 
administered (e.g., a human). Alternatives to human 
hybridoma-derived antibodies have been developed in which 
mouse immunoglobulin sequences (e.g., constant regions) 
are replaced with corresponding sequences derived from 
25 human immunoglobulin genes. Two examples of this type of 
antibody are (1) chimeric mAbs, in which murine variable 
regions are combined with human constant regions 
(Morrison et al., Proc. Natl. Acad. Sci. USA 81:6851- 
6855, 1984; Boulianne et al., Nature 312:643-646, 1984), 
30 and (2) humanized mAbs, in which murine CDRs 

(complementarity determining regions) replace the 
corresponding sequences in human immunoglobulins (Jones 
et al., Nature 321:522-525, 1986; Co et al., Nature 
351:501-502, 1991). These engineered antibodies retain 
35 their target specificity and generally exhibit reduced 
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HAMA responses when injected into patients. In addition, 
desired effector functions of antibodies for certain 
clinical applications can be obtained by using constant 
regions corresponding to the appropriate immunoglobulin 
5 isotype. 

Despite these advances, cloning of variable region 
sequences has been a limiting step in the rapid 
construction of chimeric and isotype switched antibodies. 
Polymerase chain reaction (PCR) amplification of 

10 immunoglobulin heavy and light chain variable regions has 
facilitated this step. However, the high degree of DNA 
sequence polymorphism in leader and variable sequences of 
both heavy and light chain genes has required the 
preparation of complex sets of degenerate primers (Jones 

15 et al., Bio /Techno logy 9:88-89, 1991; Kettleborough et 
al. , Eur. J. Immunol. 23:206-211, 1993; Le Boeuf et al., 
Gene 82:371-377, 1989; Orlandi et al., Proc. Natl. Acad. 
Sci. USA 86:3833-3837, 1989). In the case of 5' primers, 
these primers have usually been designed to correspond to 

20 the first framework of the variable region (FR1) and, in 
a few cases, to the leader peptide sequence (L) . The 3' 
primers have usually been designed to correspond to the 
framework 4 (FR4) region, which displays limited 
polymorphism, or to the constant region, in which 

25 conserved, isotype-specif ic sequences are easily 

identified. Although complex sets of 5' and 3' primers 
have been designed, they do not always match the DNA 
template completely (Gavilondo-Cowley et al., Hybridoma 
9:407-417, 1990; Leung et al. , BioTechniques 15:286-292, 

30 1993) . Native sequences of the immunoglobulin heavy and 
light chain genes may therefore be altered in the FR1 
and/or FR4 regions by the PCR amplification process. 
Modifications of the N-tenninal region of an 
immunoglobulin, particularly the light chain variable 

35 region (VL) , in which the amino acid at position two is 
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part of the predicted canonical structure for CDR1 
(Chothia et al., Nature 342:877-883, 1989), have been 
shown to drastically reduce the affinity of 
immunoglobulins for their antigens* Moreover, expression 
5 levels of the recombinant antibodies may also be altered 
when mutations occur in the leader peptide. In most 
studies involving PCR amplification of immunoglobulin H 
(heavy) and k/X (light) chain variable regions using 
these primers, cDNA templates were used, resulting in the 
10 generation of fragments containing incomplete VH and VL 
sequences, which may or may not be linked to part of the 
constant region. 

Summary of the Invention 
We have designed a method for isolating nucleic 

15 acids encoding immunoglobulin Fv (variable) fragments 
from genomic DNA of hybridoma cells producing specific 
monoclonal antibodies. Specific primers corresponding to 
(1) the 5' untranslated region (UTR) of the variable 
region and (2) the intron downstream of the rearranged 

20 JH/Jk/X sequences are used in this method. The method 
can be used to amplify and clone genomic DNA 
corresponding to X and k light chain variable genes, as 
well as heavy chain variable genes. The variable genes 
isolated by this method can easily be inserted into 

25 expression vectors containing heterologous (e.g., human) 
light and heavy chain constant genes, thus facilitating 
isotype switching or antibody chimerization. Using this 
method, we have cloned for the first time genes encoding 
the variable regions (Fv) of the kappa light chain and 

30 heavy chain of the antibody produced by hybridoma cell 
line HNK-20. 

Accordingly, in one aspect the invention features 
substantially pure DNA (genomic DNA or cDNA) encoding a 
variable region of the antibody produced by hybridoma 
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cell line HNK-20. The variable region can be from the 
immunoglobulin heavy chain of the antibody, or from the 
immunoglobulin light chain of the antibody. The DNA may 
further encode an immunoglobulin constant region, such as 
5 a human immunoglobulin constant region. The 

immunoglobulin can be of any isotype, including, but not 
limited to an IgA (e.g., igAl, lgA2, and slgA) , IgG, igM, 
igD, or igE isotype. In the case of an IgA isotype, the 
immunoglobulin heavy chain can be an a chain. 
10 m one embodiment, the substantially pure DNA 

contains the sequence of Fig. 5B, or degenerate variants 
thereof,, and encodes the amino acid sequence of Fig. 5B. 
In another embodiment, the substantially pure DNA 
contains a sequence having about 50% or greater sequence 
15 identity to the DNA sequence of Fig. 5B. In another 

embodiment, the substantially pure DNA a) is capable of 
hybridizing to the DNA sequence of Fig. 5B under 
stringent conditions; and b) encodes a polypeptide having 
a biological activity of a HNK-20 variable region. 
20 In another embodiment, the substantially pure DNA 

contains the sequence of Fig. 5C, or degenerate variants 
thereof, and encodes the amino acid sequence of Fig. 5C. 
In another embodiment, the substantially pure DNA 
contains a sequence having about 50% or greater sequence 
25 identity to the DNA sequence of Fig. 5C. In another 

embodiment, the substantially pure DNA a) is capable of 
hybridizing to the DNA sequence of Fig. 5C under 
stringent conditions; and b) encodes a polypeptide having 
a biological activity of a HNK-20 variable region. 
30 In another embodiment, the substantially pure DNA 

contains the sequence of Fig. 5D, or degenerate variants 
thereof, and encodes the amino acid sequence of Fig. 5D. 
In another embodiment, the substantially pure DNA 
contains a sequence having about 50% or greater sequence 
35 identity to the DNA sequence of Fig, 5D. In another 
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embodiment, the substantially pure DNA a) is capable of 
hybridizing to the DNA sequence of Fig. 5D under 
stringent conditions; and b) encodes a polypeptide having 
a biological activity of a HNK-20 variable region* 
5 In another aspect of the invention, the DNA is 

operably linked to regulatory sequences, such as promoter 
and/or enhancer sequences, for expression of the variable 
region. In a related aspect, the invention features a 
vector (e.g., a plasmid or a viral vector) containing the 

10 DNA of the invention operably linked to a promoter 

sequence. The invention also features a cell (e.g., a 
myeloma cell) containing the DNA of the invention. 

In another aspect, the invention features a 
recombinant antibody containing a variable region from 

15 the monoclonal antibody produced by hybridoma cell line 
HNK-20. In one embodiment, the variable region is from 
the immunoglobulin heavy chain of the monoclonal, 
antibody. In another embodiment, the variable region is 
from the immunoglobulin light chain of the monoclonal 

20 antibody. The recombinant antibody may further contain a 
chimeric constant region, e.g., a human immunoglobulin 
constant region. The antibody of the invention may be of 
any immunoglobulin i so type, such as those listed above. 
Accordingly, the antibody of the invention may contain an 

25 a heavy chain, and thus be of the IgA isotype. 

The invention also features a method of making a 
recombinant antibody containing a variable region from 
the monoclonal antibody produced by hybridoma cell line 
HNK-20. This method involves expression of DNA encoding 

30 the variable region of the antibody, as well as a 
constant region (e.g., a human constant region). 
Expression of the DNA may be carried out using any 
standard method known in the art. Preferably, the DNA is 
expressed in a cell, e.g., a myeloma cell. The cell is 

35 cultured under conditions in which the recombinant 
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antibody is produced, and the antibody is subsequently 
purified from the cell or from the supernatant in which 
the cell was cultured, using standard methods. 

In a final aspect, the invention features a method 
5 of isolating a nucleic acid containing a variable region 
of an immunoglobulin gene (e.g. , the immunoglobulin gene 
is from hybridoma cell line HNK-20) . In this method, a 
first set of primers, each of which contains the sequence 
of a polymorphic variant of a segment of the 5' 

10 untranslated region of said immunoglobulin gene; and a 
second set of primers, each of which contains the 
sequence of a polymorphic variant of a segment of the 
intron 3' to the rearranged J region of said 
immunoglobulin gene; are used in a polymerase chain 

15 reaction containing genomic DNA that encodes the variable 
region of the immunoglobulin. A set of primers that 
contain sequences of the immunoglobulin gene is 
identified from the polymerase chain reaction, and 
subsequently is used to amplify the immunoglobulin gene. 

By "promoter" is meant a minimal sequence element 
sufficient to direct transcription. Also included in the 
invention are those promoter elements which are 
sufficient to render promoter-dependent gene expression 
controllable for cell-type or tissue-specific expression, 
as well as elements which allow expression to be 
inducible by external signals or agents; such elements 
may be located in the 5' or 3' regions of, as well as 
within, the native gene. 

By "operably linked" is meant that a gene and a 
regulatory sequence (s) are connected in such a way as to 
permit gene expression when the appropriate molecules 
(e.g., transcriptional activator proteins) are bound to 
the regulatory sequence (s) . 



20 



25 



30 
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By "substantially identical" is meant a 
polypeptide or nucleic acid exhibiting at least 50% , 
preferably 85%, more preferably 90%, and most preferably 
95% homology to a reference amino acid or nucleic acid 
5 sequence. For polypeptides, the length of comparison 
sequences will generally be at least 16 amino acids, 
preferably at least 20 amino acids, more preferably at 
least 25 amino acids, and most preferably 35 amino acids. 
For nucleic acids, the length of comparison sequences 

10 will generally be at least 30 nucleotides, preferably at 
least 50 nucleotides, more preferably at least 75 
nucleotides, and most preferably 110 nucleotides. 

Sequence identity is typically measured using 
sequence analysis software (e.g., Sequence Analysis 

15 Software Package of the Genetics Computer Group, 
University of Wisconsin Biotechnology Center, 1710 
University Avenue, Madison, WI 53705; Ausubel et al., 
eds. Current Protocols in Molecular Biology. Wiley & 
Sons, New York, 1989) . Such software matches similar 

20 sequences by assigning degrees of homology to various 
substitutions, deletions, substitutions, and other 
modifications. Conservative substitutions for amino 
acids typically include substitutions within the 
following groups: glycine, alanine; valine, isoleucine, 

25 leucine; aspartic acid, glutamic acid, asparagine, 
glut amine; serine, threonine; lysine, arginine; and 
phenylalanine , tyrosine . 

By "substantially pure DNA" is meant DNA that is 
free of the genes which, in the naturally-occurring 

30 genome of the organism from which the DNA is derived, 
flank the gene. The term therefore includes, for 
example, a recombinant DNA which is incorporated into a 
vector; into an autonomously replicating plasmid or 
virus; or into the genomic DNA of a prokaryote or 

35 eukaryote; or which exists as a separate molecule (e.g., 
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a cDNA or a genomic or cDNA fragment produced by PCR or 
restriction endonuc lease digestion) independent of other 
sequences* It also includes a recombinant DNA which is 
part of a hybrid gene encoding additional polypeptide 
5 sequence • 

"Stringent conditions 11 , as used herein, are 
defined as follows. High stringency conditions include 
hybridization at about 42 °C in about 50% formamide; a 
first wash at about 65 °C in about 2x SSC and 1% SOS; 

10 followed by a second wash at about 65°C in about 0.1% 

SSC, Lower stringency conditions for genes having about 
50% sequence identity include hybridization at about 42 °C 
in the absence of formamide; a first wash at about 42 °C 
in about 6x SSC and about 1% SDS; followed by a second 

15 wash at about 50 °C in about 6x SSC and about 1% SDS* 

Our method for isolating immunoglobulin variable 
region genes, using genomic DNA templates, does not lead 
to the production of fragments that need to be adapted 
for recombinant antibody expression, thus facilitating 

20 the generation of chimeric and isotype-switched 

immunoglobulins. Variable regions with intact coding 
sequences, including full length leader peptides, are 
obtained using this method without requiring previous DNA 
sequencing. Thus, isotype switched mouse Ig and chimeric 

25 mouse-human Ig can easily be produced. Chimeric 

antibodies containing constant regions derived from the 
host to which the antibody is to be administered (e.g., a 
human) , are advantageous for use in therapeutics, because 
such chimeric antibodies are less likely than 

30 heterologous antibodies (e.g., murine antibodies) to lead 
to an adverse immune response, e.g., an HAMA response, in 
the patient. 

Other features and advantages of the invention 
will be apparent from the following detailed description 

35 and from the claims. 
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Detailed Description 
The drawings are first described. 

Figs. 1A-1B are schematic representations of 
5 rearranged genes encoding the variable regions of heavy 
(Fig. 1A) , kappa (Fig.* IB), and lambda (Fig* 1C) mouse 
immunoglobulin chains. Pseudogenes are indicated 
by "f M s. Bold arrows indicate PCR primers corresponding 
to the 5' untranslated region and the intron downstream 

10 of the rearranged J segment. Sail and NotI restriction 
sites have been added to the 5' ends of the PCR primers. 
Arrows below the J segments represent oligonucleotide 
probes used in Northern and Southern blot hybridization. 

Figs. 2A and 2B are Northern blots of RNA isolated 

15 from hybridoma cell lines 3G3 and HNK-20 probed with 
oligonucleotides homologous to the JH, Jk, and Jk 
segments. Fig. 2A: Total RNA (10 pg per slot) from 
hybridoma cell line 3G3 (IgM, k) was hybridized with 
oligonucleotides Jkl, Jk2, and JA.3, as indicated in the 

20 figure. Fig. 2B: Total RNA (10 /ig per slot) from HNK-20 
(IgA, k) was hybridized with JkI, Jk2, Jk4, and Jk5; and 
JH1, «JH2 , JH3, and JH4 (see Fig. 6 for the sequence of 
the oligonucleotide probes) , as indicated. 
Oligonucleotides specific for the pseudogenes (Jk3 and 

25 Jk4) were not used in these experiments. As an internal 
control for total RNA loading, the blots were 
rehybridized with a full length glyceraldehyde-3- 
phosphate-dehydrogenase (GAPDH) cDNA probe (McMaster et 
al., Proc. Natl. Acad. Sci. USA 74:4835-4838, 1977). 

30 Migration positions of ribosomal RNA are indicated. 

Figs. 3A-3G are Southern blots of amplified 3G3 
Vk, HNK-20 Vk, and HNK-20 VH genomic fragments, made by 
following a two step PCR amplification method. In the 
first step (Figs. 3A-3D) , PCR amplification was carried 

35 out using groups of 2, 3, or 4, 5' primers (see Fig. 6) 
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and a single 3' primer. The 3' primer for VX was: 3'Xl, 
for Vk: 3'k2, and for VH: 3'H3a or 3'H3b. In the second 
step (Pigs. 3B-3G) , PCR amplification was performed 
separately with each of the 5' primers from the positive 
5 groups from the first set of reactions (Figs. 3A-3D) , 
allowing identification of the 5' primer (s) that 
generate (s) the signal in each positive group. PCR 
amplification with 3'H3a was omitted in the second step 
since 3'H3b generates stronger signals. Positions of 

10 size markers of 564 f 831, and 947 basepairs are indicated 
in the lane labelled Ml. 

Figs* 4A-4B are schematic representations of the 
rearranged 3G3 VX, HNK-20 Vk, and VH regions. In Fig. 4A 
rearrangement of the lambda locus from 3G3 is shown. In 

15 Fig. 4B rearrangements of the kappa and heavy chain loci 
from HNK-20 are shown. Three different kappa chain gene 
rearrangements have been observed with the Jk2 segment. 
For the heavy chain genes, 5'H31a and 5'H31b generate the 
same PCR product, hence a single rearrangement is 

20 observed for the heavy chain. The maps are not drawn to 
scale. 

Figs. 5A-5D are sequences of the PCR amplified 3G3 
VX, HNK-2 0 Vk, and HNK-20 VH regions. The sequences 
shown in Figs. 5A, SB, 5C, and 5D correspond to 3G3 VX 

25 (generated by 5'Xl-3'Xl) , HNK-20 Vk (generated by 5'k16- 
3'k2), HNK-20 Vk (generated by 5'k30-3'k2) , and HNK-20 VH 
(generated by 5'H31b-3 'H3b) , respectively. The sequences 
of the PCR primers are shown in bold, the CDR regions are 
underlined, and stop codons are indicated by stars. 

30 Regions homologous to the JX, Jk, and JH oligonucleotide 
probes are double underlined. 

Figs. 6A-6F are the nucleotide sequences of the 
PCR primers and probes used for the amplification of 
mouse VX, Vk, and VH regions. Sail and NotI sites are in 

35 bold, and I represents inosine. Sequences are 5' to 3'. 
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The names of the groups of 5' primers precede the 
numbering and sequences of the primers. 

The WonPP^onal Antibody Produced bv Hvbridoma c ell Line 

5 Respiratory Syncytial Virus (RSV) infection can 

lead to diseases of the upper or lover respiratory tract, 
including pneumonia and severe bronchiolitis (Mcintosh et 
al., In B. N. Fields and D. M. Knipe (eds.), Virology . 
Raven Press, New York, pp. 1045-1072, 1990). HNK-20 is a 

10 murine hybridoma cell line that produces a monoclonal IgA 
antibody that recognizes the F glycoprotein of RSV. In 
addition to neutralizing RSV in plague reduction assays, 
this antibody is effective at preventing and treating RSV 
infection when passively administered to mice 

15 intranasally. 



Method for Isolating Immunoglo bulin Variable Region 
Genes, 

We have designed a strategy for the preparation of 
genomic fragments encoding the VH-D-JH (immunoglobulin 

20 heavy chain variable region) and VL-JL (immunoglobulin 
light chain variable region) regions of immunoglobulin 
genes. This strategy involves PCR amplification of 
genomic DNA prepared from hybridomas using specific 
primers corresponding to (1) the 5' untranslated region 

25 of the gene encoding the variable region, and (2) an 
intron downstream from specific rearranged JH/JL 
sequences. This method does not require previous 
knowledge of the sequence of any part of the gene 
encoding the target variable region, thus allowing rapid 

30 and efficient isolation of the genes. 

Based on analysis of nucleotide sequence databases 
(e.g., the Genbank) , sets of primers are designed 
corresponding to all of the sequences reported in the 
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databases for a given organism (e.g., a mouse) for a 
specific region within the 5' UTR of the immunoglobulin 
variable region gene and for a specific region in the 
intron downstream of the rearranged J segment. In order 
5 to minimize the numbers of primers required to cover all 
of the possible known sequences for a given region (e.g., 
the 5' UTR), the most highly conserved specific regions 
within the region may be focused on. As a specific 
example, in the case of the 5' UTR of the murine variable 
10 heavy chain, 42 primers are required because of the 
relatively high degree of nucleotide sequence 
polymorphism (see below) . In contrast, due to lower 
levels of sequence polymorphism, only 2 primers are 
required to cover all of the known possibilities for the 
15 5' UTR of the murine immunoglobulin lambda variable 
region (see below). In a first set of PCR reactions, 
multiple primers covering all of the polymorphic 
sequences corresponding to the region the primers 
hybridize to can be used in a single reaction. In cases 
20 where a large number of primers are required, the primers 
can be divided up into several reactions based on, e.g. , 
their melting temperatures. For example, for the 5' UTR 
of the murine heavy chain variable region, where 42 
primers are required to cover all of the known 
25 polymorphic sequences, the reactions were divided up into 
12 groups (see below). Primers contained in those 
reactions of the first set of PCR reactions found to be 
positive can subsequently be tested individually (i.e., 
in reactions containing only a single 5' primer and a 
30 single 3' -primer) in order to identify the sets of 

primers that correspond to the sequence of the gene being 
amplified. These primers can then be used to prepare the 
fragment for cloning, in order to generate chimeric or 
isotype switched immunoglobulin genes. As no sequence 
35 information for the gene being amplified is required in 
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the present method, isolating variable region genes from 
hybridoma cell lines is facilitated. 

Variable regions isolated by this method can be 
fused to genes encoding heterologous constant regions in 
5 order to produce chimeric (e.g., a murine variable region 
fused to a human constant region) or isotype switched 
immunoglobulin genes, that can in turn be used to make 
chimeric or isotype switched antibodies. We have used 
this method to produce chimeric immunoglobulin genes, 

10 including one containing the HNK-20 heavy chain variable 
region fused to a human heavy chain constant region, and 
another containing the HNK-20 kappa light chain variable 
region fused to a human kappa light chain constant 
region. Co-expression of these genes leads to the 

15 production of recombinant antibodies having the 

specificity of the antibody produced by hybridoma cell 
line HNK-20, and the effector regions of a human 
antibody. 

The recombinant antibodies of the invention can be 

20 used to prevent RSV infection, or to treat diseases 

caused by RSV infection (e.g., upper or lower respiratory 
tract infections, such as pneumonia or bronchiolitis). 
The recombinant antibodies of the invention may be 
administered to any hosts that are susceptible to RSV 

25 infection, including, but not limited to, humans (adults, 
children, and infants) , chimpanzees, cattle, cebus 
monkeys, owl monkeys, ferrets, lambs, mice, rats, and 
cotton rats. The therapeutic compositions of the 
invention may be administered to a patient by any 

30 appropriate mode. Typically, the antibody is 

administered to a mucosal surface of the subject, for 
example, an oral or an intranasal surface. The 
antibodies of the invention can be administered in an 
amount determined to be effective for prevention or 

35 treatment of RSV infection by one skilled in the art. An 
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appropriate dosage is one which effects a reduction in 
the disease caused by RSV infection, and/ or one which is 
effective at preventing RSV infection* It is expected 
that the dosages will vary, depending upon the mode and 
5 route of administration; the age, weight, and health of 
the recipient; the nature and extent of the disease; the 
frequency and duration of the treatment; the type, if 
any, concurrent therapy; and the desired effect. For 
example, the amount of antibody administered may be in 

10 the range of 50 ng /kg to 5 mg/kg body weight. The 
invention also includes compositions containing the 
antibodies of the invention in a pharmaceutical^ 
acceptable carrier and/or diluent, e.g., saline. 
Suitable pharmaceutical carriers, as well as 

15 pharmaceutical necessities for use in pharmaceutical 

formulations, are described in Remington's Pharmaceutical 
Sciences, a standard reference text in this field, and in 
the USP/NF. 

The antibodies of the invention may also be used 

20 in diagnostic methods for identifying patients infected 
with RSV, using any of a number of standard assay systems 
that are well known in the art. Such assay systems 
include, but are not limited to, enzyme linked 
immunosorbent assays (ELISA) , solid phase radiometric 

25 assays, immunof luorescent microscopy, and immuno electron 
microscopy (see, e.g., Coligan et al., eds., Current 
Protocols in Immunology . John Wiley & Sons, New York, 
1992). In these methods, an antigen (e.g., from a 
biological sample, such as cellular material or 

30 secretions) is contacted with an antibody of the 

invention, and specific binding of the antigen and the 
antibody is detected as a measure of the presence of the 
antigen in the sample. The detection can be facilitated 
by the presence of labels on the antibodies of the 

35 invention or labels on secondary antibodies that 
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recognize the antibodies of the invention. The labels 
that can be employed in these assays include, but are not 
limited to, radio- labels, enzyme labels (e.g., horse- 
radish peroxidase) , biotin-labels, and chemi luminescent 
5 labels, and are detected using standard methods. 

Genes encoding immunoglobulin variable regions 
(both light (kappa and lambda) and heavy chains) isolated 
using the methods of the present invention can be fused 
to genes encoding immunoglobulin constant regions (e.g., 
10 human constant region genes) in order to make genes 
encoding chimeric or isotype switched recombinant 
antibodies (see, e.g., United States Patent Number 
4,816,397, issued March 28, 1989; hereby incorporated by 
reference; and Morrison et al., Proc. Natl. Acad. Sci. 
15 USA 81:6851-6855, 1984; hereby incorporated by 

reference) . For example, in order to generate an IgA 
antibody, the variable region can be fused to an o 
constant region. Different types of constant regions 
that can be used to make the chimeric antibodies of the 
20 invention are well known in the art (see, e.g., Roitt et 
al., eds. Tmmunoloov . Gower Medical Publishing, London, 
1989) , and can be isolated and cloned to make the 
chimeric and/or isotype switched genes of the invention 
using standard methods (see, e.g., Sambrook et al., eds. 
25 Molecular cloning; & laboratory Manual. Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, New York, 
1989; and Coligan et al. , supra). In order to facilitate 
cloning of the variable genes into vectors, preferably, 
the PCR primers used in their amplification contain in 
30 their 5' ends sequences recognized by restriction 

endonuc leases. Vectors into which the variable region 
genes and chimeric genes of the invention can be cloned 
include both plasmid and viral vectors. Preferred 
vectors for use in the invention are expression vectors 
35 containing appropriate heterologous heavy or light chain 
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constant: genes. For example, genes encoding kappa light 
chain and heavy chain variable regions can be cloned into 
plasmids such as pUHWic and pUHWyl, respectively 
(Weissenhorn et al., Gene 106:273-277, 1991, hereby 
5 incorporated by reference) . These plasmids contain a 
combination of a heavy-chain enhancer and a /x-gene 
promoter, as well as a polylinker into which genes 
encoding variable regions can be inserted. Another 
vector that can be used in the invention is the pING 

10 expression vector (Chomczynski et al., Biochem. Biophys. 
Res. Commun. 122:340-344, 1984, see below). 

The genes of the invention are cloned into the 
vectors so that the genes are operably linked to 
appropriate promoter/ enhancer sequences. Any promoter 

15 that is capable of directing initiation of transcription 
in a eukaryotic cell may be used in the invention. For 
example, non-tissue specific promoters, such as the 
cytomegalovirus (DeBernardi et al., Proc. Natl. Acad. 
Sci. USA 88:9257-9261, 1991, and references therein), 

20 mouse metallothionine I gene (Hammer et al., J. Mol. 
Appl. Gen. 1:273-288, 1982) , HSV thymidine kinase 
(McKnight, Cell 31:355-365, 1982), and SV40 early 
(Benoist et al., Nature 290:304-310, 1981) promoters may 
be used. Preferred promoters for use in the invention 

25 include those which direct expression in myeloma cells, 
as are described above. 

Expression of the genes encoding the chimeric 
and/or isotype switched antibodies of the invention can 
be achieved using any of a number of methods standard in 

30 the art. For example, vectors, e.g., plasmids, 
containing the HNK-20 variable region (s) fused to 
heterologous constant regions can be transf ected into a 
cell in which the gene is expressed, either 
constitutively or upon induction. The genes can be 

35 transf ected into the cells using any standard method in 
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the art, including, but not limited to, electroporation, „ 
calcium phosphate precipitation, protoplast fusion, and 
the use of viral vectors or lipids that are coupled with 
the genes (see, e.g., Sambrook et al., supra). Any cell 
5 in which the genes of the invention can be expressed to 
produce the antibodies of the invention may be used. 
Such cells include, but are not limited to, myeloma 
cells, such as non-Ig-producing myeloma cells. As a 
specific example, Sp2/0-Agl4 cells, which are murine 

10 myeloma cells that do not secrete or synthesize any 

immunoglobulin chains, can be used. Sp2/0-Agl4 cells can 
be obtained from the American Type Culture Collection, 
12301 Parklawn Drive, Rockville, Maryland 20852-1776, and 
have been designated ATCC CRL 1581 (see also Ochi et al., 

15 Proc. Natl. Acad. Sci. USA 80:6351-6355, 1983). Other 
cell lines that can be used in the invention include 
P3X63Ag8.653 (ATCC CRL 1580) and Sp2/01-Ag (ATCC CRL 
8006) . 

The antibodies of the invention can be produced by 
20 culturing cells expressing the genes encoding them using 
standard methods. The antibodies are then purified from 
the cells and/or cell culture supernatants using standard 
methods (see, e.g., Coligan et al., supra). The culture 
conditions may be scaled-up using standard methods, in 
25 order to generate large quantities of antibody. In 
addition, the antibodies may be purified from ascites 
using standard methods. 

Experimental Results 

Design of the primers and probes for PCR amplification of 
30 mouse VH and VL genes* 

Most strategies used to amplify mouse VH and VL 
genes generate fragments encoding variable regions that 
are either truncated or contain mutations in their 5' 
and /or 3' ends. We have developed a PCR method for 
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amplifying Fv (variable) genes that results in 
preservation of the entire Fv sequence. The primers used 
in this method hybridize to the 5' untranslated region 
(the 5' primers) and in the intron downstream of the 
5 rearranged J segment of the lambda, kappa, and heavy 
chain genes (the 3' primers; Fig. 1). 

DNA sequence comparisons of 5' untranslated 
regions and introns downstream from J segments were 
performed using data from Kabat (Kabat et al., Sequences 

10 of Proteins of Im munological Interest. 5th Ed., U.S. 
Dept. Health and Human Services, Washington, DC, 1991) 
and the Genbank and European Molecular Biology Laboratory 
(EMBL) libraries, and were further analyzed with the 
Genetics Computer Group (GCG, Madison, WI) package 

15 programs. Alignment of the DNA sequences of the 5' 
untranslated region revealed that the polymorphism in 
this region is globally identical to that of the leader 
peptide sequences. However, a stretch of around 20 
nucleotides immediately upstream of the start codon (ATG) 

20 was observed to be more conserved throughout the 

alignment. We therefore designed a series of 5' PCR 
primers (18-mers) that terminate with the ATG sequence, 
ensuring perfect matches of the 3' ends of each primer 
with the template, as is critical for initiation of 

25 consistent priming by Taq polymerase. A Sail restriction 
site and 4 additional nucleotides, which facilitate 
digestion by the restriction endonuclease, were added to 
the 5' end of the primers in order to expedite cloning of 
the PCR fragments. Complex sets of 5' primers were 

30 synthesized consisting of 2 primers for lambda, 30 

primers for kappa, and 42 primers for the heavy chain 
genes, some of which primers contained inosine residues, 
or were degenerate (Fig. 6) . 

In contrast, alignment of the 5' end of the intron 

35 sequences downstream from the J segments did not reveal 
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significant polymorphism, so that 3' PCR primers specific 
for each of the JX, Jk, and JH segments were designed in 
this intronic region* Single PCR primers were prepared 
corresponding to the regions downstream from each J 
5 segment , except for the JH3 segment , for which 2 primers 
were needed because of DNA sequence polymorphism. The 3 # 
PCR primers are 18-mers, except for the primer downstream 
of the JH1 segment which was designed as a 20-mer because 
of the high A/T content of this region. A NotI 

10 restriction site and 4 additional nucleotides were added 
to the 5' end of these oligonucleotides in order to allow 
directional cloning of the PCR fragments into expression 
vectors (Fig. 6) . 

In order to ensure specificity of the PCR 

15 amplifications and to better characterize the VH-D-JH and 
VL-JL rearrangements, JL and JH-specific oligonucleotide 
probes were designed. Sets of oligonucleotides (19-mers) 
specific for each of the JA, Jk, and JH segments, which 
exhibit very low polymorphism, were synthesized. These, 

20 oligonucleotides were used as probes in both Southern 
blot analysis of the PCR fragments, and Northern blot 
analysis of total RNA extracted from mouse hybridoma 
cells. 

Specificity of the oligonucleotide probes. 

25 The specificity of the oligonucleotide probes 

homologous to each J segment in DNA— DNA hybridization was 
demonstrated by Southern blot hybridization using a 
variety of cloned VL and VH PCR fragments of known 
sequences. The specificity of these probes was also 

30 tested by hybridization to RNA on Northern blots. RNA 
from the hybridoma cells 3G3 (IgM, X) and HNK-20 (IgA, k) 
were hybridized with the JA, Jk, and JH oligonucleotides 
(Fig. 2) . The Northern blot analysis revealed that 
probes JXl, Jk2, and JH3 generate signals at the expected 

35 sizes for 3G3 and HNK-20, respectively. Hybridization of 
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HNK-20 RNA with the Jk2 probe generated a much stronger 
signal than the JH3 probe, although these two probes had 
similar specific activities and identical calculated 
melting temperatures. This observation suggests either 
5 that the kappa chain transcript is much more abundant, or 
that several kappa chain transcripts hybridize with the 
Jk2 probe. PCR amplification of the Vk region supports 
the hypothesis that there are several different kappa 
chain gene rearrangements, one of which was derived from 

10 a transcribed pseudogene rearranged with the Jk2 segment 
present in the immortalized fusion partner X63Ag8.653 
(Strohal et al., Nucleic Acids Res. 15:2771, 1987; 
Carroll et al., Mol. Immunol. 25:991-995, 1988). Thus, 
at least one kappa pseudogene and one kappa functional 

15 gene transcript, both rearranged with the Jk2 segment, 
are contributing to the strong signal observed. The 
Northern blots were then rehybridized with a probe 
specific for the house-keeping enzyme gene 
glyceraldehyde-3-phosphate-dehydrogenase (Piechaczyk et 

20 al., Nucleic Acids Res. 12:6951-6963, 1984) to control 
for the amount of RNA loaded (Fig. 2) . 

Use of the above-described J-specific 
oligonucleotide probes allows rapid characterization of 
the PCR products and immunoglobulin gene transcripts, and 

25 further allows identification of the J segment used in 
the rearrangement of the light and heavy chain genes. 

Amplification and detection of mAb variable region genes. 

Several immunoglobulin variable region genes have 
been amplified by PCR using genomic DNA prepared from 
3 0 mouse hybridoma cells. As specific examples, 

amplification of the VX gene of the hybridoma cell line 
3G3, and the Vk gene and VH genes of hybridoma cell line 
HNK-20 are described. 
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Based on the Northern blot analysis shown in Fig. 
2, it was established that the X chain gene was 
rearranged with JXl, the k chain gene was rearranged with 
Jk2, and the heavy chain gene was rearranged with JH3 . 
5 Therefore, the 3' primers required for amplification were 
3'Al, 3'k2, and 3 / H3a/3 / H3b. Due to the high level of 
polymorphism in the 5' untranslated region, if all of the 
5' primers were used in separate PCR reactions, there 
would be 2 PCR reactions for lambda, 30 PCR reactions for 

10 kappa, and 42 PCR reactions for the heavy chain genes. 
To reduce this complexity, the 5' primers were divided 
into small groups of 2, 3, or 4 primers based on their 
melting temperatures (Fig* 6) . One group was required 
for X, 9 for k, and 12 for the heavy chain gene primers, 

15 reducing the number of reactions to 1, 9, and 12, 
respectively. 

The fragments produced in the first set of PCR 
amplification reactions, as analyzed on Southern blots 
hybridized with the JXl, Jk2, and JH3 probes, 

20 corresponded to the expected sizes in the 5' A group a, in 
the 5'k groups g and i, and in the 5'H groups c and 1 
(Figs. 3A, 3B, 3C, and 3D) . In the second set of PCR 
reactions, each 5' primer from the positive groups 
identified in the first set of PCR reactions was combined 

25 with the corresponding 3' primer, and the amplified 

fragments were analyzed on Southern blots using the JXl, 
Jk2, and JH3 oligonucleotide probes* This experiment 
revealed that 5'Al, 5'k30, 5'k16, 5'k19 and 5'H31B 
generate the predicted signals (Figs. 3£, 3F, and 3G) , 

30 indicating that rearranged fragments were found for one 
lambda chain gene in 3G3, 3 kappa chain genes in HNK-20, 
and one heavy chain gene in HNK-20 (Fig. 4) . Using 
another mouse hybridoma cell (PCG-4), secreting IgG2a 
with a k chain, 3 different rearrangements of the kappa 

35 chain genes, and a single rearrangement of the heavy 
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chain gene, were observed using the same PCR strategy. 
This observation is consistent with the presence of both 
a functional Vk and aberrant Vk transcripts in hybridoma 
cells obtained by fusion using cell lines derived from 
5 the original MOPC-21 tumor (Strohal et al. , Nucleic Acids 
Res. 15:2771, 1987; Carroll et al., Mol. Immunol. 25:991- 
995, 1988). A third non-transcribed kappa chain 
rearrangement was detected in both HNK-20 and PCG-4 
hybridomas which involved different Jk segments. Since 
10 the only k -chain gene that the fusion partner can 

contribute is the non- functional MOPC-21 k gene (Storb et 
al., Nucleic Acids Res. 8:4681-4687, 1980), this third 
rearrangement probably originated from the mouse B-cell 
that served as a fusion partner. 
15 sequence analysis of the VX, Vk, and VH PCR fragments. 

The PCR amplified variable regions of 3G3 VX, HNK- 
20 Vk, the HNK-20 Vk pseudogene, and HNK-20 VH were each 
cloned into the pING expression vector (Chomczynski et 
al., supra ) and sequenced using the dideoxy chain 
20 termination method (Piechaczyk et al., Nucleic Acids Res. 
12:6951-6963, 1984). The corresponding sequences are 
shown in Figs. 5A-5D. The deduced amino acid sequences 
of the VX, Vk, and VH fragments correspond to open 
reading frames consistent with those reported in the 
25 various gene banks. In all three PCR amplified 

fragments, the positions of the cysteines involved in 
intramolecular disulfide bridge formation were conserved. 
The presence of conserved stretches of amino acids 
corresponding to framework sequences allowed positioning 
30 of the CDRs (Fig. 5) . Taken together, these observed 
structural features indicate that the amplified and 
cloned Fv fragments are functional and thus are able to 
recognize their corresponding antigen, provided they are 
expressed in appropriate host cells. Accordingly, the 
35 murine k and H Fv genes corresponding to the antibody 
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produced by the HNK-20 hybridoma have been inserted into 
expression vectors containing human heavy and light chain 
constant genes and transfected into myeloma cells. 
Materials and Methods 
5 Sources of DNA and RNA. 

Mouse hybridoma cell line 3G3 (IgM, X) was 
deposited with the American Type Culture Collection, and 
was assigned ATCC No. HB 8516. Mouse hybridoma cell line 
HNK-20 (IgA, k) , from OraVax Inc. , Cambridge, USA, was 

10 deposited with the American Type Culture Collection 

(ATCC, Rockville, Maryland) under the provisions of the 
Budapest Treaty on July 1, 1993, and was assigned ATCC 
No. HB 11394. Genomic DNA was prepared as described 
( Gross-Bel lard et al., Eur. J. Biochem. 36:32-38, 1973, 

15 hereby incorporated by reference) . Total RNA was 
prepared from frozen cell pellets according to the 
guanidine/ cesium chloride method (Glisin et al., 
Biochemistry 13:2633-2637, 1974; Chirgwin et al*. 
Biochemistry 18:5294-5299, 1979). 

20 Synthesis of oligonucleotide primers and probes. 

Oligonucleotides were synthesized using standard 
methods involving the j9-cyanoethyl phosphoramidite method 
and reverse phase HPLC purification (MWG-Biotech, 
Ebersberg, Germany) . 

25 DMA amplification by PC*. 

In vitro DNA amplification (PCR) was performed in 
a final volume of 100 /xl volume using the thermal cycler 
9600 from Perkin Elmer (Norwalk, CT) . Reagents were 
added to the reaction to yield the following final 

30 composition: 10 mM Tris-HCl pH 8.3 (at 25°C) , 50 mM KC1, 
2.5 mM MgCl 2 , 0.001% gelatin (Sigma, Cat. No. G2500, St. 
Louis, MO), 200 /iM dNTPs, 150 nM of each amplification 
primer, 1 fig of genomic ONA, and 2.5 units of AmpliTaq 
DNA polymerase (Perkin Elmer, Norwalk, CT) . The cycling 

35 profile used is as follows: 5 minutes at 94 °C; 3 cycles 
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of 1 minute at 94°C, 1 minute at 50 °C, and 1 minute at 
72°C; 27 cycles of 1 minute at 94°C, 1 minute at 62°C, 1 
minute at 72 °C; and a final incubation at 72°C for 10 
minutes . 

5 Northern and Southern blot analysis. 

Total RNA (10 ng) was denatured with glyoxal and 
fractionated by electrophoresis on an agarose gel 
(McMaster et al., Proc. Natl. Acad. Sci. USA 74:4835- 
4838, 1977). Transfer of RNA onto Gene Screen Plus 

10 membrane (Dupont, Wilmington, DE) was carried out 

according to the procedure described by the manufacturer. 
PCR products and genomic DNA prepared from hybridoma 
cells were fractionated on agarose gels and transferred 
onto Gene Screen Plus membrane as described (Chomczynski 

15 et al., Biochem. Biophys. Res. Commun. 122:340-344, 
1984) . 

Northern and Southern blots were hybridized in 
identical conditions with 32 P-labeled oligonucleotide 
probes specific for each of the JX, Jk, and JH segments. 

20 The temperatures used for prehybridization and 

hybridization were TM-4°C, while the temperatures used 
for washing were TM-2°C. The melting temperatures (TMs) 
of the oligonucleotides were estimated by the formula: TM 
= 4(G+C) + 2 (A+T) . The TMs of the J-specific 

25 oligonucleotide probes are as follows: JXl = 58 °C, JX2 » 
62°C, JA.3 = 60°C, JA.4 = 58°C, JkI « 64*C, Jk2 = 64°C, Jk3 
= 56°C, JK4 = 64°C, Jk5 = 62°C, JH1 = 68°C, JH2 = 64°C, 
JH3 = 64 °C, and JH4 « 60°C. For both Southern and 
Northern blots probed with oligonucleotide probes, the 

30 following conditions were used. Prehybridization was 
carried out for 3-4 hours in 2x SSC, 5x Denhardts, 0.1% 
SDS, and 5 mM EDTA. Hybridization was carried out for 
14-18 hours in 5x SSC, lOx Denhardts, 20 mM sodium 
phosphate buffer (pH 7.0), 7% SDS, 100 ^ig/ml denatured 

35 herring sperm DNA, and 32 P-labeled oligonucleotide (106 
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cpm/ml) . After hybridization, the membranes were washed 
twice for 30 minutes in 3x SSC, lOx Denhardts, 5% SDS, 
and 70 mH sodium phosphate buffer (pH 7.0); and twice for 
30 minutes in ix SSC and 1% SDS. 
5 As an internal control for total RNA loading. 

Northern blots were hybridized with a glyceraldehyde-3- 
phosphate-dehydrogenase cDNA probe (Piechaczyk et al., 
Nucleic Acids Res. 12:6951-6963, 1984). For both 
Northern and Southern blots probed with the cDNA probe, 

10 the following conditions were used. Prehybridization was 
carried out for 3-5 hours at 42 °C in 50% deionized 
formamide, 5x Denhardts, 5x SSPE (30x SSPE = 4.5 M NaCl, 
0.3 M NaH 2 P0 4 , 30 mM EDTA, pH 7.7), 1% SDS, and 200 fiq/nl 
denatured salmon sperm DNA. Hybridization was carried 

15 out for 14-18 hours at 42 °C in the same buffer including 
32 P-labeled cDNA probe (106 cpm/ml) . After hybridization, 
the membranes were washed twice for 5 minutes in 2x SSPE 
at room temperature, once for 15 minutes in 2x SSPE and 
0.5% SDS at 65 °C, and once for 15 minutes in 0.5x SSPE at 

20 65°C. 

DNA sequencing. 

DNA sequences were determined by subcloning the L- 
VH-D-JH and L-VL-JL PCR fragments in the expression 
vector pING (Liu et al., Gene 54:33-40, 1987), followed 

25 by sequencing using the dideoxy chain termination method 
(Sanger et al., Proc. Natl. Acad. Sci. USA 74:5463-5467, 
1977) . 

Pthgg EPfro<3;i]ffients 

The invention also includes any biologically 
30 active fragment or analog of the genes and antibodies of 
the invention. By "biologically active" is meant 
possessing any activity which is characteristic of HNK-20 
variable genes or chimeric antibodies containing them. 
The invention also includes analogs of the HNK-20 
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immunoglobulin variable chains. Preferred analogs 
include those with sequences which differ from the 
sequences shown in Figs. 5B-5D only by conservative amino 
acid substitutions, for example, substitution of one 
5 amino acid for another with similar characteristics 

(e.g., valine for glycine, arginine for lysine, etc.) or 
by one or more non-conservative amino acid substitutions, 
deletions, or insertions which do not abolish the 
immunoglobulin's biological activity. Analogs of the 

10 invention will generally exhibit at least 70%, preferably 
80%, more preferably 90%, and most preferably 95%, or 
even 99%, homology with a segment of 20 amino acid 
residues, preferably 40 amino acid residues, or more 
preferably the entire sequence of an immunoglobulin of 

15 the invention. Alterations in the primary sequence 

include genetic variants, both natural or induced. Also 
included are analogs that include residues other than 
naturally occurring or synthetic amino acids, e.g., p or 
Y amino acids. Also included are immunoglobulins 

20 modified by in vivo chemical derivitization, including 

acetylation, methylation, phosphorylation, carboxylation, 
or glycosylation. 

In addition to substantially full-length 
polypeptides, the invention also includes biologically 

25 active fragments of the immunoglobulins, which can be 
made using standard methods in the art. As used herein, 
the term "fragment," as applied to a polypeptide, such as 
an immunoglobulin, will ordinarily be at least 20 
residues, more preferably at least 40 residues in length. 

30 Similarly, the invention also includes fragments of the 
genes encoding the immunoglobulins of the invention. 

It is understood that the examples and embodiments 
described herein are for illustrative purposes only and 
that various modifications or changes in light thereof 

35 will be suggested to persons skilled in the art and are 
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to be included within the spirit and scope of this 
application and the appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Berdoz, Jose 

Kraehenbuhl, Jean Pierre 

(ii) TITLE OF INVENTIONS PCR AMPLIFICATION OF REARRANGED GENOMIC 
VARIABLE REGIONS OF IMMUNOGLOBULIN GENES 

(ill) NUMBER OF SEQUENCES: 108 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson 

(B) STREET: 225 Franklin Street, Suite 3100 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110-2804 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.30B 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/348,548 

(B) FILING DATE: 01-DEC-1994 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Clark, Paul T. 

(B) REGISTRATION NUMBER: 30,162 

(C) REFERENCE /DOCKET NUMBER: 06132/009001 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (617) 542-5070 
<B) TELEFAX: (617) 542-5070 
(C) TELEX: 200154 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 537 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GATCGTCGAC CGTGGTTTGT GAATTATGGC CTGGATTTCA CTTATACTCT CTCTCCTGGC 60 

TCTCAGCTCA GGTCAGCAGC CTTTCTACAC TGCAGTGGGT ATGCAACAAT GCGCATCTTG 120 

TCTCTGATTT GCTACTGATG ACTGGATTTC TCATCTGTTT GCAGGGGCCA TTTCCCAGGC 180 

TGTTGTGACT CAGGAATCTG CACTCACCAC ATCACCTGGT GAAACAGTCA CACTCACTTG 240 

TCGCTCAAGT ACTGGGGCTG TTACAACTAG TAACTATGCC AACTGGGTCC AAGAAAAACC 300 
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AGATCATTTA TTCACTGGTC TAATAGGTGG TACCAACAAC CGAGCTCCAG GTGTTCCTGC 360 

CAGATTCTCA GGCTCCCTGA TTGGAGACAA GGCTGCCCTC ACCATCACAG GGGCACAGAC 420 

TGAGGATGAG GCAACATATT TCTGTGCTCT ATGGTACAGC AACCATTGGG TGTTCGGTGG 480 

AGGAACCAAA CTGACTGTCC TAGGTGAGTC ACTGGTCCCT CCTTTGCGGC CGCTGAT 537 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 

(C) STRANDEONESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Trp lie Ser Leu lie Leu Ser Leu Leu Ala Leu Ser Ser Glv 
1 5 10 15 

Ala lie Ser Gin Ala Val Val Thr Gin Glu Ser Ala Leu Thr Thr Ser 
20 25 30 

Pro Gly Glu Thr Val Thr Leu Thr Cya Arg Ser Ser Thr Gly Ala Val 
35 40 45 

Thr Thr Ser Aan Tyr Ala Asn Trp Val Gin Glu Lys Pro Asp Hie Leu 
50 55 60 

Phe Thr Gly Leu lie Gly Gly Thr Asn Asn Arg Ala Pro Gly Val Pro 
65 70 75 80 

Ala Arg Phe Ser Gly Ser Leu lie Gly Asp Lys Ala Ala Leu Thr lie 
85 90 95 

Thr Gly Ala Gin Thr Glu Asp Glu Ala Thr Tyr Phe Cys Ala Leu Trp 
100 105 110 

Tyr Ser Asn His Trp Val Phe Gly Gly Gly Thr Lys Leu Thr Val Leu 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3x 
GATCGTCGAC GGACTCAGCA TGGACATGAG GACCCCTGCT CAGTTTCTTG GAATCTTGTT 60 
GCTCTGGTTT CCAGGTAAAA TGAACTAAAA TGGGAATGTC ACTGTGATTA GTGTTGATTG 120 
GCATTTGGGA GATTTTATCT TTTATGATGC TTACCTATGT AGATACTCAT TATGTCTCCA 180 
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TTCCTAGGTA TCAAATGTGA CATCAAGGTG ACCCAOTCTC CATCTTCCAT GTATCCATCT 
CTAOCAGAOA OAGTCACTAT CACTTGCAAG CCGAGTCAGG ACATTAATAA CTATTTAAAC 
TCGTTCCAGC AGAAACCAGG GAAATCTCCT AAGACCCTGA TCTATOGTGC AAACAGATTG 
CTAGATGGGG TCCCATCAAG GTTCAGTGGC AGTGGATCTG GGCAAGATTA TTCTCTCACC 
ATCAOCAGCC TGGAGTATGA AGATATGGGA ATTTATTATT GTCTACAGTT TGACGAGTTT 
CCGTACACGT TCGGAGGGGO GACCAAGCTG GAAATAAAAC GTAAGTAGTC TTCTCAACTC 
TTGOGGCOGC TGAT 
(2) INFORMATION FOR SEQ ID NOi4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
Met Arg Thr Pro Ala Gin Phe Leu Gly lie Leu Leu Leu Trp Phe Pro 

Gly He Lye Cys Asp He Lys Val Thr Gin Ser Pro Ser Ser Met Tyr 
20 25 JW 

Ala Ser Leu Gly Glu Arg Val Thr He Thr Cys Lys Ala Ser Gin Asp 
35 40 

He Asn Asn Tyr Leu Asn Trp Phe Gin Gin Lys Pro Gly Lys Ser Pro 

50 55 60 

Lys Thr Leu He Tyr Arg Ala Asn Arg Leu Leu Asp Gly Val Pro Ser 
65 70 75 

Arg Phe Ser Gly Ser Gly Ser Gly Gin Asp Tyr Ser Leu Thr He Ser 

ser Leu Glu Tyr Glu Asp Met Gly He Tyr Tyr Cys Leu Gin Phe Asp 

100 105 

Glu Phe Pro Tyr Thr Phe Gly Gly Gly Thr Lys Leu Glu He Lys 
^15 120 " a 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GATCGTCGAC TTCCAGCTCT CAGAGATGGA GACAGACACA CTCCTGTTAT GGGTACTGCT 



240 
300 
360 
420 
480 
540 
554 



60 
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GCTCTGGGTT CCAGGTGAGA GTGCAGAGAA GTGTTGGATG CAACCTCTGT GG CCATTATG 120 

ATACTCCATG CCTCTCTGTT CTTGATCACT ATAATTAGGG CATTTGTCAC TGGTTTTAAG 180 

TTTCCCCAGT CCCCTGAATT TTCCATTTCC TCAGAGTGAT GTCCAAAATT CTTCTTAAAA 240 

ATTTAAATCA AAAGGTCCTC TGCTGTGAAG TCTTTTATAC ATATATAACA ATAATCTTTG 300 

TGTTTATCAT TCCAGGTTCC ACTGGTGACA TTGTGCTGAC ACAGTCTCCT GCTTCCTTAG 360 

CTGTATCTCT GGGGCAGAGG GCCACCATCT CATACAGGGC CAGCAAAAGT GTCAGTACAT 420 

CTGGCTATAG TTATATGGCG TGGAACCAAC AGAAACCAGG ACAGCCACCC AGACTCCTCA 480 

TCTATCTTGT ATCCAACCTA GAATCTGGGG TCCCTGCCAG GTTCAGTGGC AGTGGGTCTG 540 

GGACAGACTT CACCCTCAAC ATCCATCCTG TGGAGGAGGA GGATGCTGCA ACCTATTACT 600 

GTCAGCACAT TAGGGAGCTT ACACGTTCGG AGGGGGGACC AAGCTGGAAA TAAAACGTAA 660 

GTAGTCTTCT CAACTCTTGC GGCCGCTGAT 690 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 

<C) ST RAND ED NESS : not relevant 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro 
15 10 15 

Gly Ser Thr Gly Asp lie Val Leu Thr Gin Ser Pro Ala Ser Leu Ala 
20 25 30 

Val Ser Leu Gly Gin Arg Ala Thr lie Ser Tyr Arg Ala Ser Lye Ser 
35 40 45 

Val Ser Thr Ser Gly Tyr Ser Tyr Met Ala Trp Asn Gin Gin Lys Pro 
50 55 60 

Gly Gin Pro Pro Arg Leu Leu lie Tyr Leu Val Ser Asn Leu Glu Ser 
65 70 75 80 

Gly Val Pro Ala Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr 
85 90 95 

Leu Asn lie HiB Pro Val Glu Glu Glu Asp Ala Ala Thr Tyr Tyr Cys 
100 105 110 

Gin His lie Arg Glu Leu Thr Arg Ser Glu Gly Gly Pro Ser Trp Lys 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 546 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



GATCGTCGAC 


CTCAAGGTCC 


TTACAATGAA 


ATGCAGCTGG 


GTGATCTTCT 


TCCTGATGGC 


DO 


AGTGGTTACA 


GGTAAGGAGC 


TCCCAAGTCC 


CAAACTTGAG 


GGGCCATACA 


CTCTGTGACA 


120 


GTGGCAGTCA 


CTTTGCCTTT 


CTTTCTACAG 


GGGTCAATTC 


AGAGGTTCAG 


CTGCAGCAGT 


180 


CTGGGGCTGA 


GCTTGTGAGG 


CCAGGGGCCT 


TAGTCAAGTT 


GTCCTGCAAA 


GCCTCTGGCT 


240 


TCAACATTAA 


AGACTACTAT 


ATGTACTGGG 


TAAAACAGAG 


GCCTGAACAG 


GGCCTGGAGT 


300 


GGATTGGATG 


GATTGATCCT 


GAAAATGGTA 


ATACTGTTTA 


TGACCCGAAG 


TTCCAGGGCA 


360 


AGGCCAGTAT 


AACAGCAGAC 


ACATCCTCCA 


ACACAGCCTA 


CCTGCAGCTC 


AGCAGCCTGG 


420 


CATCTGAGGA 


CACTGCCGTC 


TATTACTGTG 


CTTACTACGG 


TACTAGCTAC 


TGGTTTCCTT 


480 


ACTGGGGCCA 


AGGGACTCTG 


GTCACTGTCT 


CTGCAGGTGA 


GTCCTACCTT 


CTCCGCGGCC 


540 


GCTGAT 












546 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Met Lys Cys Ser Trp Val lie Phe Phe Leu Met Ala Val Val Thr Val 

15 10 15 

Asn Ser Glu Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Arg Pro 
20 25 30 

Gly Ala Leu Val Lys Leu Ser Cys Lys Ala Ser Gly Phe Asn lie Lys 
35 40 45 

Asp Tyr Tyr Met Tyr Trp Val Lys Gin Arg Pro Glu Gin Gly Leu Glu 
50 55 60 

Trp lie Gly Trp lie Asp Pro Glu Asn Gly Asn Thr Val Tyr Asp Pro 
65 70 75 80 

Lys Phe Gin Gly Lys Ala Ser lie Thr Ala Asp Thr Ser Ser Asn Thr 
85 90 95 

Ala Tyr Leu Gin Leu Ser Ser Leu Ala Ser Glu Asp Thr Ala Val Tyr 
100 105 110 

Tyr Cys Ala Tyr Tyr Gly Thr Ser Tyr Trp Phe Pro Tyr Trp Gly Gin 
115 120 125 
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Gly Thr Leu Val Thr Val Ser Ala 
130 135 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY x linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GATCGTCGAC CTTGGTTTGT GAATTATG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GATCGTCGAC AGTAGTACCT GCATTATG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GATCGCGGCC GCAAAGGAGG AGGAGTTAC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

ATCAGCGGCC GCAAGAAGCA TTAAAGCCAC 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ATCAGCGCCC GCAAGAAGCT TTGAAACTAC 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GATCGTCGAC AAATTCAAAK ACAMAAT 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GATCGTCGAC AAGACTCAGC CTGACATG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16^ 
GATCGTCGAC AAGTTCAAAG ACAAAATG 
(2) INFORMATION FOR SEQ ID NO: 17 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
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GATCGTCGAC AGACTCAGCC TGACATG 



27 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GATCGTCGAC AGCAGGGGGA GCAGGATG 28 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GATCGTCGAC AGGGAAAGTT TGAAGATG 28 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GATCGTCGAC ATACATCAGA CCAGCATG 28 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GATCGTCGAC ATCTAGYTCT CAGAGATG 28 
(2) INFORMATION FOR SEQ ID NO: 22: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GATCGTCGAC ATGCATCACA CCAGCATG 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GATCGTCGAC CACCAAGTTC TCAGAATG 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
GATCGTCGAC CAGAGCAGCA GGGACATG 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GATCGTCGAC CAGGGACAAG TGGGAATG 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 




PCT/US95/15716 



28 



28 



28 



BNSDOCID: <WO 9616974A1> 



WO 96/16974 




- 37 - 



GATCGTCGAC CATTCAGAAC TCAGCATG 
(2) INFORMATION FOR SEQ ID NO: 27: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPES DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

GATCGTCGAC GCGAGTCAGA CCAGCATG 

(2) INFORMATION FOR SEQ ID NO: 28: 

( i ) SEQUENCE CHARACTERISTICS : 
(A> LENGTH: 28 base pairs 
(B) TYPE: nucleic acid 
{ C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
GATCGTCGAC GGACACAGTT TAGATATG 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GATCGTCGAC GGACTCAGCA TGGACATG 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GATCGTCGAC GGAGACGTTG TAGAAATG 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



GATCGTCGAC GGATACACCA TCAGCATG 



28 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

GATCGTCGAC GGCAARGGCA TCAAGATG 28 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;33; 
GATCGTCGAC GGCAGRGGRA GCAAGAT 27 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GATCGTCGAC GGTCACAGCA CAAACATG 28 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 
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GATCGTCGAC GGTTGCCTCC TCAAAATG 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
GATCGTCGAC GTTCATTTCC TCAAAATG 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
GATCGTCGAC TATCAAGTTC TCAGAATG 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GATCGTCGAC TCTCAAGTTC TCAGAATG 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
GATCGTCGAC TCTTGTGAAT TAATCATG 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
GATCGTCGAC TGAAAACACA CAGACATG 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GATCGTCGAC TGATAAAGCC AAGGAATG 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
GATCGTCGAC TGATCACACA CAGWCATG 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
GATCGTCGAC TTCCAGCTCT CAGAGATG 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
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ATCAGCGGCC GCAGAGASTT TGGATTCTAC 



30 



(2) INFORMATION FOR SEQ ID NO: 45: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
ATCAGCGGCC GCAAGAGTTG AGAAGACTAC 30 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
ATCAGCGGCC GCAGTTGAGC AAAAATGTAC 30 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
ATCAGCGGCC GCAAATGAGC AAAARTCTAC 30 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
ATCAGCGGCC GCAAGATGAG AAAAGTG T AC 30 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
GATCGTCGAC ACACAGACTC ACACCATG 
(2) INFORMATION FOR SEQ ID NO s 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
GATCGTCGAC ACACAGGACC TCACCATG 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
GATCGTCGAC ACACAGGATC TCACCATG 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
GATCGTCGAC ACACAGGGCA TTGCCATG 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
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GATCGTCGAC ACACTGACTC AAAACATG 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28 base pair a 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
GATCGTCGAC ACACTGACTC AAACCATG 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
GATCGTCGAC ACACTGACTC ACACCATG 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
GATCGTCGAC ACACTGACTC CAACCATG 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

GATCGTCGAC ACACTGACTC TAACCATG 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
< D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
GATCGTCGAC ACACTGACTC TCACCATG 28 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
GATCGTCGAC ACACTGACTT CACCATG 27 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
GATCGTCGAC ACATAGACTC TAACCATG 28 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
GATCGTCGAC ACATTGACTC AAACCATG 28 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
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GATCGTCGAC AGCCTCCATC AGAGCATG 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
GATCGTCGAC AGCCTCCGTC AGAGCATG 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
GATCGTCGAC ATTATAACAT TGAACATG 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
GATCGTCGAC CAAGTCTTAG ACATCATG 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
GATCGTCGAC CACACATCCC TTACCATG 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
GATCGTCGAC CACAGACACC TCACCATG 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
GATCGTCGAC CACAGACCMC TCACCATG 
(2) INFORMATION FOR SEQ ID NO: 69: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
GATCGTCGAC CACAGACCTG TCAACATG 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
GATCGTCGAC CACAGACCTG TCACCATG 
(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
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GATCGTCGAC CACGGAACCC TCACCATG 
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(2) INFORMATION FOR SEQ ID NO: 72: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
GATCGTCGAC CACGGACCCC TCACCATG 28 
(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
GATCGTCGAC CACGGACCCC TCACGATG 28 
(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
GATCGTCGAC CACTCGACTC TAACCATG 28 
(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
GATCGTCGAC CACTGGTGTG CAGTCATG 28 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
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(C) STRAND ED NESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
GATCGTCGAC CACTTCTTAG ACATCATG 
(2) INFORMATION FOR SEQ ID NO: 77: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE : nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
GATCGTCGAC CAGAGTCCAC TCRCCATG 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
GATCGTCGAC CCTGTCACTG ACTTCATG 
(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
GATCGTCGAC CTCAAGGTCC TTACAATG 
(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
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GATCGTCGAC CTCCAGGTCC TTACAATG 
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(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
GATCGTCGAC CTCAGTCCTG TCACCATG 28 
(2) INFORMATION FOR SEQ ID NO: 82 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
GATCGTCGAC CTCAGTCCTG TCACTATG 28 
(2) INFORMATION FOR SEQ ID NO: 83 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
GATCGTCGAC GCAGAGGACC TCACAATG 28 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
GATCGTCGAC GCCTTTACAG ACTTCATG 28 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
GATCGTCGAC GGACCTCACC ATGGGATG 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
GATCGTCGAC GGGTGTTGCC TAAGGATG 
(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
GATCGTCGAC GGTGTWGCCT AAAAGATG 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 
GATCGTCGAC GGTGTTGCCT AAAGGATG 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
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GATCGTCGAC GTTGTAGCCT AAAAGATG 



28 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
GATCGTCGAC TCAGTCCTTG TCACTATG 28 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
ATCAGCGGCC GCAAAGAAAA AAGCCAGCTT AC 32 
(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
ATCAGCGGCC GCGAGGTTKT AAGGACTCAC 30 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
ATCAGCGGCC GCGGAGAART TAGGACTCAC 30 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
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<C) STRANDEDNESS : single 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
ATCAGCGGCC GCGGAGAAGK TAGGACTCAC 
(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
ATCAGCGGCC GCTGGAGAGG CCATTCTTAC 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GTCAGTTTGG TTCCTCCAC 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
GTGACCTTGG TTCCACCGC 
(2) INFORMATION FOR SEQ ID NO:98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
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GTGACCTTGG TTCCACTGC 



19 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
GTCAATCTGG TTCCACCTC 19 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
GTGCCTCCAC CGAACGTCC 19 
(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
GTCCCCCCTC CGAACGTGT 19 
(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
GTCCCATCAC TGAATGTGA 19 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : Bingle 

(D) TOPOLOGY: linear 

(it) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
GTCCCCGAGC CGAACGTGA 
(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
GTCCCAGCAC CGAACGTGA 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
GACCGTGGTC CCTGCGCCC 
(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
GAGAGTGGTG CCTTGGCCC 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
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GACCAGAGTC CCTTGGCCC 19 
(2) INFORMATION FOR SEQ ID HO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
GACTGAGGTT CCTTGACCC 19 
What is claimed is: 
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1. substantially pure DNA encoding a variable 
region of the antibody produced by hybridoma cell line 
HNK-20. 

2. The DNA of claim 1, wherein said variable 
5 region is from the immunoglobulin heavy chain of said 

antibody • 

3. The DNA of claim 1, wherein said variable 
region is from the immunoglobulin light chain of said 
antibody. 

10 4- The DNA of claim 1, further encoding an 

immunoglobulin constant region. 

5. The DNA of claim 4, wherein said 
immunoglobulin constant region is a human immunoglobulin 
constant region. 

15 6. The DNA of claim 4, wherein said DNA encodes 

an immunoglobulin a heavy chain. 

7. The DNA of claim 1, wherein said DNA is 
genomic DNA. 

8. The DNA of claim 1, wherein said DNA is cDNA. 

20 9. Substantially pure DNA comprising the sequence 

of Fig. 5B f or degenerate variants thereof, and encoding 
the amino acid sequence of Fig. 5B. 

10. Substantially pure DNA comprising a sequence 
having about 50% or greater sequence identity to the DNA 
25 sequence of Fig. 5B. 



BNSDOCID:<WO 9616974A1> 



WO 96/16974 



PCT/US95/15716 



- 57 - 

11. Substantially pure DNA which a) is capable of 
hybridizing to the DNA sequence of Fig. 5B under 
stringent conditions; and b) encodes a polypeptide having 
a biological activity of a HNK-2 0 variable region. 

5 12. Substantially pure DNA comprising the 

sequence of Fig. 5C, or degenerate variants thereof, and 
encoding the amino acid sequence of Fig. 5C. 

13. Substantially pure DNA comprising a sequence 
having about 50% or greater sequence identity to the DNA 

10 sequence of Fig. 5C. 

14. Substantially pure DNA which a) is capable of 
hybridizing to the DNA sequence of Fig. 5C under 
stringent conditions; and b) encodes a polypeptide having 
a biological activity of a HNK-20 variable region. 

15 15. Substantially pure DNA comprising the 

sequence of Fig. 5D, or degenerate variants thereof, and 
encoding the amino acid sequence of Fig. 5D. 

16. Substantially pure DNA comprising a sequence 
having about 50% or greater sequence identity to the DNA 

2 0 sequence of Fig. 5D. 

17. Substantially pure DNA which a) is capable of 
hybridizing to the DNA sequence of Fig. 5D under 
stringent conditions; and b) encodes a polypeptide having 
a biological activity of a HNK-20 variable region. 

25 18. The DNA of claim 1, wherein said DNA is 

operably linked to regulatory sequences for expression of 
said variable region; and 
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wherein said regulatory sequences comprise a 
promoter . 

19 . A vector comprising the DNA of claim 1. 

20. A cell comprising the DNA of claim 1. 

5 21. A recombinant antibody comprising a variable 

region from the monoclonal antibody produced by hybridoma 
cell line HNK-20. 

22* The recombinant antibody of claim 21, wherein 
said variable region is from the immunoglobulin heavy 
10 chain of said monoclonal antibody. 

23. The recombinant antibody of claim 21, wherein 
said variable region is from the immunoglobulin light 
chain of said monoclonal antibody. 

24. The recombinant antibody of claim 21, further 
15 comprising a human immunoglobulin constant region. 

25. The recombinant antibody of claim 21, further 
comprising an immunoglobulin a heavy chain. 

26. A method of making a recombinant antibody 
comprising a variable region from the monoclonal antibody 

20 produced by hybridoma cell line HNK-20, said method 

comprising the steps of: 

a. providing DNA encoding said variable region of 

said monoclonal antibody, said DNA further encoding a 

constant region; 
25 b. expressing said DNA. 
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27. A method of making a recombinant antibody 
comprising a variable region from the monoclonal antibody 
produced by hybridoma cell line HNK-20, said method 
comprising the steps of: 

5 a. providing a cell containing DNA encoding said 

variable region of said monoclonal antibody, said DNA 
further encoding a constant region; 

b. culturing said cell under conditions in which 
said recombinant antibody is produced; and 
10 c. purifying said recombinant antibody from said 

cell or from the supernatant in which said cell was 
cultured. 

28. A method of isolating a nucleic acid 
comprising a variable region of an immunoglobulin gene, 

15 said method comprising the steps of: 

a. providing genomic DNA comprising said 
immunoglobulin gene; 

b. providing a first set of primers, each of 
which contains the sequence of a polymorphic variant of a 

20 segment of the 5' untranslated region of said 
immunoglobulin gene ; 

c. providing a second set of primers, each of 
which contains the sequence of a polymorphic variant of a 
segment of the intron 3' to the rearranged J region of 

25 said immunoglobulin gene; 

d. carrying out a polymerase chain reaction with 
said genomic DNA and said first and said second sets of 
primers; 

e. identifying a set of primers from said 

3 0 polymerase chain reaction that comprise sequences of said 
immunoglobulin gene; and 

f . amplifying said immunoglobulin gene with said 
identified set of primers. 
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29. The method of claim 28, wherein said 
immunoglobulin gene is from hybridoma cell line HNK-20. 
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Fig. 4a 
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FIG- 5 A 

x 

axKotc^coworn^mto cc= tcc att tca err ata ctc tct ctc 55 

H«c Ala Trp He Sec Leu lie Leu Sec Leu 10 

CTC CCT CTC AGO TCA C CTC ACCACCCTTTCT ACACTC C ACTCCCTATCC AACAATCCCCAT 11« 
L«u Al* L«u S*r S«r C 

^CTCTCTCATTTCCTACTCATCAC^ « «* ~= 

ly Ala He Ser Gin 20 



GCT CTT CTC ACT CAC CAA TCT CCA CTC ACC ACA TCA CCT CCT CAA ACA CTC 229 
All v.l v.l T*c Cin Clu sec Al* Leu Tt»r Tnc Ser Pro Cly Clu Tnr v.l 37 



XCA CTC ACT TCT CSC TCA ACT ACT CCS CCT CTT ACA ACT ACT AAC TAT CCC 280 

— Cjr kmn ~ * ' * J 4 



■mr uau Trtr C/a tl 



n» v»l 



AAC TCC CTC CAA CAA AAA CCA CAT CAT TTA TTC ACT CCT CTA ATA C~. CCT 311 
^ Trp v.l cm Clu Ly» Pro Aap His Leu >ne T*r Cly i.«u lie CLy Sit 

ACC AAC AAC CCA CCT CCA CCT CTT CCT CCC ACA TTC TCA CCC TCC CTC ATT JB2 

n f v . ... .~ ... Cly v.i Pro Al* Aro P«e Cer Cly Sec Leu lie 89 

CCA CAC AAC CCT CCC CTC ACC ATC ACA CCS CCA CAC ACT CAC CAT CAC CCA «» 

Cly Aap Ly, Al. Ale L.u Tnc lie Tir Cly Ala Cin The Clu a.o Clu Ale 105 

... Tvr 1TC tct CCT CTA TCC T.VT ACC AAC CAT TCC CTC TTC C^T fiq> «5A <3< 

TAT 1TC TCT T -~j,,- ... V,. Ph. Cly Cly Cly 122 

tie Tyc fne Cya V 1 ' »" ■ ■ '-- ■ 1 1,1 



xrr- >x> i — r kct CTC CTA r, cn^irxjiciwTOcjw... 
Ttir Ly« Leu Thr v.l Leu 



SEQ ID NO: 1 
SEQ ID NO: 2 
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FIG- 5B 



OATcaTCOACoaxcrcAOcxToaxcjLTa acg acc cct cct cac ttt ctt cca atc 55 SEQ ID NO: 3 

h«c Arg Ttir pro Ala Cln Phe Leu Cly lie 10 SEQ ID NO: 4 

TTC TTC CTC TCC TTT CCA C CTAAAATC AACT AAAATCGC AATCTC ACTCTC ATTACTCTTC 116 

Leu Leu Leu Trp Pne Pro C 16 



A VK^ C AlTlT^ ACA inTi A lVri^rA TCATCCT^ 133 

CTAC CT ATC AAA TCT CAC ATC AAC CTC ACC CAC TCT CCA TCT TCC ATC TAT 71< 

ly tie Lya Cys Asp He Lys val Ti\c Cln Ser Pro Ser Ser Hec Tyr 32 

CCA TTT CTA CTA CA/"t \T.A (TTT ACT ATT ACT TCC A AC CCC ACT CAC CAC ATT 285 

Ala Ser Leu Cly Clu Azg Val TJir He Tnr Cys t.v< * '» <*" >« n 49 

AAT AAC TAT. TTA AAC TTC TTT CAT r\v. AAA rr* <W: AAA TCT CCT A AC ACC 3 36 

A « r 1^ f Tr? pne Cln Cln Lys Pro cly Lys Ser Pro Ly* Tnr 66 

CTC ATC TAT CCT CCA AAC AC A TTC CTA CAT CCC CTC CCA TCA ACC TTC ACT 3 87 

Leu lie Tyr i\* *™ Arer r.»u g.#»> A<sn ciy vai pro Ser Arg pne Ser 83 

CCC ACT CCA TCT CCC CAA CAT TAT TCT CTC ACC ATC ACC ACC CTC CAC TAT 4 38 

Cly S*r Cly Ser Cly Cln Asp T/r Ser Leu Tnr He Ser Ser Leu Clu Tyr 100 

CAA CAT ATC CCA ATT TAT TAT TCT CTA CAC TTT CAC CAC TTT CCC TAC i<TT 4 89 

Clu Asp nee Cly tie Tyr Tyr Cys r.»u ac q r,u } oh> p-^ Ty r 1 17 

cca ccc ccc *£c aac ctc caa ata aaa c ctaaotaotcttctcaac tcttoco 54$ 

Pn« Cly Cly Cly Ttir Lya Leu Clu lie uys 12? 



GCCOCTOAT SS4 
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FIG, 5C 

c 

oxTctrrcaxcTTCcxocrc^ gac aca cxc acx ctc ere tta tcc gta ss SEQ ID NO: 5 

hcc ciu Thr ajp Thr Leu Leu Leu xrp v*i io SEQ ID mo: o 

etc ctc ctt: tec err cca c crcACAcrrcc^ACAACTcrrccATCCAACL-icxu icccca us 

Leu Leu Leu Trp Val Pro C 16 

TTATG ATACTCC ATCC L ' 11 11 1U 1 111 » C ATC ACTATAATTAGGCC ATTTCTCAL'IUU 1 1 1 1 AACTT 1 8 2 

TCCCCAgtCCCSTCAA'nTrCC AV r^CTC ACACTCATCTCCAAA Ani - 1 ' I A AAAATTT AAATC 2 49 

xaaac o :wt »u ^s. :i *i aa g ix. : : 1 1 atacatatataacaataaiv: 1 : 11 :u ; r i atcattccag 3 is 

CT TCC ACT CCT CAC ATT CTC CTC ACA CAC TCT CCT CCT TCC TTA CCT CTA 3S5 

ly Ser Thr Cly Asp lie val Leu Thr Gin Ser Pro Ala Ser Leu Ale val 33 

TCT CTC COC CAC ACC CCC ACC ATC TCA TAC ACG CCC ACC AAA ACT CTC .\CT 4 16 

Ser Leu Cly Cln Arg Ala Thr lie Ser Tyr Arg Ala Ser Lya Ser val Ser SO 

ACA TCT CCC TAT ACT TAT ATC CAC TCC AAC CAA CAC AAA CCA CCA CAC CCA 467 

Thf Set Cly Tyi Set Tyr Mot Ilia Trv* Aan Clu Cln :-y.n Pro Cly Cln fro r.7 

CCC ACA CTC CTC ATC TAT CTT CTA TCC AAC CTA CAA TCT CCC CTC CCT CCC SIB 

Pro Arg Leu Leu lie Tyr Leu val Ser Asn Leu Ciu Ser Cly val Pro Ala 3 4 

ACC TTC ACT CCC ACT CCC TCT CCC ACA CAC TTC ACC CTC ,*AC ATC CAT CCT 569 

Arg Phe Ser Cly Ser Cly Ser Cly Thr Aap Phe iMr L*u Ajn lie Mis iro 101 

CTC CAC CAC CAC CAT CCT CCA ACC TAT TAC TZTT CAC CAC ATT A^V, CAC CTT 620 

val Clu Clu Clu Asp Ala Ala Thr Tyr Tyr Cys Cl;i Hta Hi Arg Clu Leu 118 

*r» <t- rm ran r,r,r. ryr* rC A ACC TCC aaa TAA aac oTAAOTAQTCTTCTCAACT 67 S 
Thr Arg Ser Clu Cly Cly Pro Ser Trp Lys ••• 



CTTOCOOCCOCTOAT 



12Z 
690 
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FIG. 5D 

o 

eATCOTGSACCTCAAOOTCCTTACAATO AAA TCC ,ACC TCC CTC ATC TTC TTC CTC S 5 SEQ ID NO: 7 

Mae Lyi cy* Sar Trp Val.-Ile Ph a Pha l«u io SEQ ID NO: 8 

ATC CCA CTC CTT ACA C < ^'*"*^ G *CCTCCCAACTCCCAAACTTCAGCCCCCATACACTCTCT US 

Mac Ala v«i val Tt»r C 

IS 

CACACTCCCACTCAi. 1 1 1LU.1 . U. U IVrACAC CC CTC AAT TCA CAC CTT CAC CTC 173 

ly v«i Kan s«r Clu Val cin Lau 23 

CAC CAC TCT CSS CCT CAC CTT CTC ACC CCA CCC CCC TTA CTC AAC TTC TCC 224 
cm Cln Sar Cly M. Clu Law val Ara Pro cly Ala t.u val Ly, Lau S.r 40 

TCC AAA CCC TCT CCC TTC AAC ATT AAA CAC TAC TAT ATC TAC TCC CTA AAA 275 

Cy, Ly, Ala S.r Cly n r,. , ^ ^ ^ ^ ^ 

CAC ACC CCT CAA CAC CCC CTC CAC TCC ATT CCA TCC ATT CAT CCT CAA AAT 32S 
Cin Arg Pro Clu cin Cly Lau clu Trp :i. cly r, ,„ 

CCT AAT ACT CTT TAT CAC CCC AAC TTC CAC CCC AAC CCC AST ATA ACA CCA 377 
C1Y Mn T> ' Y" Tyr Aap fro Ly. Pha cm Cly «.,, Ala Sar II* Tnr Ala 91 

CAC ACA TCC TCC AAC ACA CCC TAC CTC CAC CTC ACC ACC CTC CCA TCT CAC 428 
Asp Tnr sar Sar A,n Thr Ala Tyr Leu Clrt Lau Sar Sar Leu *L S«r Clu 103 

CAC ACT CCC CTC TAT TAC TCT CCT TAC TAC CCT ACT ACC TAC TCC TTT CCT 479 

Aap TT-r Ala vai Tyr Tyr cy* Aia Ty r tv- ci„ t». ^ n 125 

TAC m **r «~ ~. ACT CTC TCT CCA C OTOAOTCCTACCTTCTC S33 

Tjet Trp Cly cm Cly TOr Lau Tnr v.m S#r Al4 

COOaOCCOCTOAT 

S4S 
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FIG. 6 A 



A) PCR pritnera for VI : 



(*) 5*X1 : CATCCTCCACCITGC TITO rGAATTATG 


SEQ 


ID 


MO: 


9 


(4) 502 ; C ATCCT V CC A C AGTAGTACCTGCATTATG 


SEQ 


ID 


NO: 


10 


31! : GATCCCCCCCCCAAACGAGGIAGGAGTIAC 


SEQ 


ID 


NO: 


11 


302 : ATCAGCCCCCCCAAGAACCATTaAAGCCaC 


SEQ 


ID 


NO: 


12 


3TL3 : ATCAG C G G C C C C AAG AAG C1TIU A AACTAC 


SEQ 


ID 


NO: 


13 


FIG. 6B 










8) PCR primer* for V< : 










(a) 5V1 : GATCGTCGACAAATTCAAAG/TAOAA/CAAT 


SEQ 


ID 


NO: 


14 


(0. 5**2 : GATCGTCGACAACACTCAGCCTG ACATG 


SEQ 


ID 


NO: 


15 


(a) 5x3 : O ATCGTC G A C AAtTITCAA AC A C AAAATC 


SEQ 


ID 


NO: 


16 


(d) 5V4 : CATCGTCCACAnACTrAnrrnnACATn 


SEQ 


ID 


NO: 


17 


(h) 5<5 : CATCGTCGACAGCACCCGCACCACGATC 


SEQ 


ID 


NO: 


18 


(b) 5**6 : GATCGTCGACAGGCAAAGTITGAAGAI G 


SEQ 


ID 


NO: 


19 


(d) 5V7 : GATCnTCCACATACATrAr.ArrAnCATn 


SEQ 


ID 


NO: 


zo 


"(d) 5kS : GATCCTCGACATCTAGOTTCTCAGAGATG 


SEQ 


ID 


NO: 


21 


(0 5**9 : GATCCTCGACATGCATCACACCAGCATC 


SEQ 


ID 


NO: 


zz 


(d> 5V10: GATCGTCCACCACCAAGTTCTCAGAATG 


SEQ 


ID 


NO: 


23 


(h) 5 # k1 i: GATCGTCCACCAGAGCAGCAGGGACATG 


SEQ 


ID 


NO: 


24 


00 5'icl2: GATCGTCGACCAGGGACAAGT'CGGAATG 


SEQ 


ID 


NO: 


25 


(e) 5**13: C ATCGTCG A CCATTCAG AACTCAGCATG 


SEQ 


ID 


NO: 


26 


(h) 5*14: GATCCTCGACGCGAGTCAGACCAGCATG 


SEQ 


ID 


NO: 


27 


(b) 5**' 5; GATCGTCGACGCACACAGTITAGATATG 


SEQ 


ID 


NO: 


28 


(i) ?*16: GaTCCTCCACGGaCTCaGCATCGACATG 


SEQ 


ID 


NO: 


• 29 


(d) 5**17: GATCCTCCACCGAGACCTTGTAGAAATG 


SEQ 


ID 


NO; 


. 30 


(0 5"<18: CATCCTCCACCGATACACCATCaGCaTG 


SEQ 


ID 


NO: 


: 31 


(i) 5**19: GATCCTCGACGCCAAA/GGGCaTCAAGaTC 


SEQ 


ID 


NO: 


: 32 


(i) 5**20: GATCCTCCACGCCACG/TCGA/GaGCAAGaT 


SEQ 


ID 


NO: 


: 33 


(a) 5**21: GaTCCTCCACGCTCACAGCaCAAACATG 


SEQ 


ID 


NO 


: 34 


(S) 5**22: GATCGTCGACGGTTGCCTCCTCAAAATG 


SEQ 


ID 


NO 


: 35 


(b) 5**23: UATCCrrCUAGCnTGATrrCCrCAAAATG 


SEQ 


ID 


NO 


: 36 


(*) 5*24: G ATCGTCG A CTaTCA AGTTCIC AG A ATG 


SEQ 


ID 


NO 


: 37 


(c) 5**25: G ATCGTCG ACTCTCAAGTTCTCAG AATG 


SEQ 


ID 


NO 


: 38 


U) 5*26: G ATCGTCG AClCTfCTGAATlAATCATG 


SEQ 


ID 


NO 


: 39 


(c) 5**27: G ATCGTC G A CTG AAA AC AC AC A G AC ATG 


SEQ 


ID 


NO 


: 40 


(c) 5**28: GATCCTCGACTCATaaaGCCAAGGAATG 


SEQ 


ID 


NO 


: 41 


(c) 5**29: G ATCCTCG A CTG ATCA C ACAC AG A/TCaTG 


SEQ 


ID 


NO 


: 42 


(g) 5*30: CI ATCGTCC A CTTCC AGCTCTOaO AG ATC 


SEQ 


ID 


NO 


: 43 


3**1 : ATCACCCCCCGCaGaGAG/CI l"l GG ATTCTaC 


SEQ 


ID 


NO 


: 44 


3**2 : ATCACCCGCCCCAAGACrrUAGAAUACrAC 


SEQ 


ID 


NO 


: 45 


3 x3 ; ATCACCCCCCCCAGTTGACCAAAAATGTAC 


SEQ 


ID 


NO 


: 46 


3**4 : ATCAG CGG CCG CA A ATG AGCA A AAA/GTCTAC 


SEQ 


ID 


NO 


: 47 


3 *5 : aTCaCCCCCCCCAAGATGaGAAAAGTCTaC 


SEQ 


ID 


NO 


: 48 
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FIG. 6C 

Q PCR primers for VH : 



0) 5w : gatcgtcgacacacagactcacaccatg 


SEQ 


ID 


NO: 


49 


(t) 5H2 : CATCCTCCACACACACCACCTCACCATC 


SEQ 


ID 


NO: 


SO 


Q> 5H3 : GATCCTCGACACACAGGaTCTCACCaTC 


SEQ 


id:tno: 


' 51 


(g) 5H4 : GATCCTCGACACACAGGCCaTTGCCATG 


S£Q 


ID 


NO: 




(b) 5115 : C ATCGTCCA C ACACTG ACTCAAAACaTG 


SEQ 


ID 


NO: 


53 


(c) 5*H6 : CATCCTCCACACACTGACTCAAACCATG 


SEQ 


IDnro: 


54 


0) 5H7 : G ATCGTCCA C ACACTG ACTCaCACCATC 


SEQ 


ID! TO: 


55 


(j) 511* : GATCCTCUACACACTCACITXAACCATG 


SEQ 


nr no: 


56 


fc> 5H9 : G ATCGTCCA CAOACTGAi. u. l AACCATG 


SEQ 


TTi 

A LJ 


NO* 


J i 


(k) 5 HIO: G ATCGTCCA C ACACTG ACTCTCACCATG 


SEQ 


UJ 


ho: 


58 


fcc) 5 11 1 l^OA I (-UTCOACACAC IXj AC I HuACCA PC 


b£A{ 


TTi 


nu • 




Oi) 5*HI2: G ATCCTCG A C AC ATAGACTCTA ACCATG 


SEQ 


LD 


NO: 


60 


(o) 3 Hi 3: CaTCCTCGACACATTCACTCAAACCATG 


SEQ 


ID 


NO: 


61 


J MM: uATCGTCGACAGCCiuCATCAGAGCATG 


SEQ 


ID 


NO: 


At ^ 


tei J Hi j, UAir.CTCCACAuwUK.Lii 1 C. AQ AuCATC 




UJ 


no • 


DJ 


1*1 J niO. UAiCLiTCGACATTATAACAriljAAwATG 


SEQ 


ID 


NO: 


54 


W J n 1 / : L» A 1 UUTUbALtAAU iUl i AUALA iCATG 






ElU* 


UJ 


(k) 31118: GATCGTCGACCACACATCCCTTACCATG 


SEQ 


ID 


NO: 


At At 

66 


Ch) 5W9: GATCGTCCACCACAGACACCTCACCATG 


SEQ 


ID 


NO: 


67 


<e) 3*K20: GATCGTCCACCACAGACCA/CCTCACCATG 


SEQ 


ID 


NO: 


68 


(k) SH21: GATCGTCGACCACAGAC<. l(i 1LAACATG 


SEQ 


ID 


NO: 


69 


(h) 5*H22: G ATCGTCCA CCACAGACCTCTC ACCATG 


SEQ 


ID 


NO: 


70 


(e) 3H23: G ATCGTCCA CCACGGAACCCTC ACCATG 


SEQ 


ID 


NO: 


71 


(0 3H24: GATCCTCCACCACCGACCCCTCACCATG 


SEQ 


ID 


NO: 


72 


CO' 51*25: GATCCTCGACCACGGACCCCTCACGATG 


SEQ 


ID 


NO: 


: 73 


(k) 5H26: GATCCTCCACCACTCGACTCTAACCATG 


SEQ 


ID 


NO: 


: 74 


(h) 3'H27; G ATCCTCG A CC ACTGGTGTGC AGTCATG 


SEQ 


ID 


NO: 


: 75 


(*) 5112S: G ATCGTCCA CCACTTCIT AG ACATCATG 


SEQ 


ID 


NO: 


: 76 


CO 5*H29: G ATCGTCCA CCAGAGTCCACTCa/GCCATG 


SEQ 


ID 


NO: 


: 77 


(t) 5*H30: CATCCTCGACCCTCTCACTCACTTCATG 


SEQ 


ID 


NO: 


: 78 


(c) 31!3U:CATCCrrCCACCTCAAGG^CCTTACAATG 


SEQ 


ID 


NO: 


: 79 


m 5Wlb:GATCCTCCACCTCCAG<;iCCl lACAATG 


SEQ 


ID 


NO: 


: 80 


0) 5H32: GATCCTCGACCTCVGTCCTGTCACCATG 


SEQ 


ID 


NO: 


: 81 


(1) 5-H33: G ATCCTC G A CCrCACTlCCrGTC ACT ATG 


SEQ 


ID 


NO 


: 82 


Ci) 5H34: G ATCCTCG A CGCAGAGGACCTCACA ATG 


SEQ 


ID 


NO 


: 83 


(d) 3 H35: GaTCC^CGACGCCTTTaCaCaCTTCATG 


SEQ 


ID 


NO 


: 84 


(0 5H36: GaTCCTCCACGCACCTCaCCATCCCATG 


SEQ 


ID 


NO 


: 85 


(0 5 H37: C ATCCTCG ACCCG Ui I UiCCTAaGGATG 


SEQ 


ID 


NO 


: 86 


(d) 5'H38: GaTCCTCGaCGGTCTA/TGCCTAAAaGATG 


SEQ 


ID 


NO 


: 87 


(1) 5113V: CA1VGTCGACGG1VITCCLTAAACCATG 


SEQ 


ID 


NO 


: 88 


(i) 5H40: CATCCTCCACGTrGTAGCCTAAAAGATG 


SEQ 


ID 


NO 


: 89 


(<J) TH4I: GATCCTCCACTCAGTCCl ICICACTATG 


SEQ 


ID 


NO 


: 90 


3*HI : ATCACCCCCCCCaaaGAAAAAAGCCAGCTTaC 


SEQ 


ID 


NO 


: 91 


3H2 : aTCaCCCCCCCCGAGG II I/O I A AGG ACTC A C 


SEQ ID NO 


: 92 


3H3a: ATCAC CCG CCC CGG AG A A AyCTTAGG ACTCAC 


SEQ ID NO 


t 93 


3 H3b: ATCACCCCCCGCGGaGAAGT/GTAGGACTCaC 


SEQ 


ID 


NO 


t?94 


3H4 : ATCACCCCCCCCTCCAGAGGCCAnCI I AC 


SEQ 


ID 


NO 


: 95 
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FIG. 6D 

0) JX oligo probe* : 

l\\ : GTCaCTTTCCTTCCTCCaC SEQ ID NO: 96 

1X1 : GTCACCTTCCTTCCACCCC SEQ ID NO: 97 

J13 : GTCACCTTGGTTCCACTCC SEQ ID NO: 98 

JX4 ; GTCAATCTGGTTCCACCTC SEQ ID NO: 99 
FIG. 6E 

E) J< oligo probes : 

IkI : GTGCCTCC ACCG AACGTCC SEQ ID NO: 100 

J<2 : GTCCCCCCTCCGAACGTGT SEQ ID NO: 101 

JjG : GTCCCaTCACTGAATGTGA SEQ ID NO: 102 

Jk4 : GTCCCCGACCCGAACGTGA SEQ ID NO: 103 

Jx5 : GTCCCAGCACCGAACGTCA SEQ ID NO: 104 

FIG. 6F 

F) JH oligo probes : 

JH1 : GACCGTGCTCCCTGCGCCC SEQ ID'TJO: 105 

JH2 : GAGAGTGGTGCCTTGCCCC SEQ ID NO: 106 

JK3 rGACCAGAGTCCCTTGGCCC SEQ ID NO: 107 

JH4 : GACTGaGGTTCCTTCaCCG SEQ ID NO: 108 
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